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We present a number of new results on one of the most extensively studied topics in computational 
geometry, orthogonal range searching. All our results are in the standard word RAM model: 
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1. We present two data structures for 2-d orthogonal range emptiness. The first achieves O(nlglgn) 
space and 0(lg Ign) query time, assuming that the n given points are in rank space. This improves 
the previous results by Alstrup, Brodal, and Rauhe (FOCS'00), with 0(n \g E n) space and 0(lg lg n) 
query time, or with O(nlglgn) space and 0(lg 2 lgn) query time. Our second data structure uses 
0{n) space and answers queries in 0(lg e n) time. The best previous (9(n)-space data structure, due 

\iy . to Nekrich (WADS'07), answers queries in 0(lg n/ lg lg n) time. 

2. We give a data structure for 3-d orthogonal range reporting with 0(n lg 1+e n) space and 0(lg lg n + 
' k) query time for points in rank space, for any constant e > 0. This improves the previous results by 

Afshani (ESA08), Karpinski and Nekrich (COCOON'09), and Chan (SODA 11), with 0(nlg 3 n) 
space and 0(lg lg n + k) query time, or with Oin lg 1+e n) space and 0(lg 2 lg n + k) query time. 
Consequently, we obtain improved upper bounds for orthogonal range reporting in all constant di- 
mensions above 3. 

Our approach also leads to a new data structure for 2-d orthogonal range minimum queries with 
\ 0(n \g e n) space and 0(lg lg n) query time for points in rank space. 

3. We give a randomized algorithm for 4-d offline dominance range reporting/emptiness with running 
time 0(n\gn) plus the output size. This resolves two open problems (both appeared in Preparata 
and Shamos' seminal book): 

(a) given a set of n axis-aligned rectangles in the plane, we can report all k enclosure pairs (i.e., 
pairs (ri,r2) where rectangle r\ completely encloses rectangle r-i) in 0(n\gn + k) expected 
time; 

(b) given a set of n points in 4-d, we can find all maximal points (points not dominated by any 
other points) in 0(n lg n) expected time. 

The most recent previous development on (a) was reported back in SoCG'95 by Gupta, Janardan, 
Smid, and Dasgupta, whose main result was an 0([n lg n + k] lg lg n) algorithm. The best previous 
result on (b) was an 0(n lg n lg lg n) algorithm due to Gabow, Bentley, and Tarjan — from STOC'84! 
As a consequence, we also obtain the current-record time bound for the maxima problem in all 
constant dimensions above 4. 
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1 Introduction 



We revisit one of the most fundamental and well-studied classes of problems in computational geometry, or- 
thogonal range searching. The goal of these problems is to preprocess a set of n input points in ti-dimensional 
space such that one can efficiently aggregate information about the points contained in an axis-aligned query 
rectangle or box. The most typical types of information computed include counting the number of points, 
computing their semigroup or group sum, determining emptiness, and reporting the points in the query range. 
These problems have been studied extensively for more than three decades, yet many questions have remained 
unresolved. See e.g. flU M EU E M E M M M M MMMMMMMMME&MMMM 
for just a fraction of the vast amount of publications on orthogonal range searching. 

Recent papers E [3.1 have made progress on the pointer machine model and I/O model. In this paper, we 
study orthogonal range searching in the standard word RAM model, which is arguably the most natural and 
realistic model of computation to consider in internal memory. We obtain the best RAM upper bounds known 
to date for a number of problems, including: 2-d orthogonal range emptiness, 3-d orthogonal range reporting, 
and offline 4-d dominance range reporting. 

1.1 Range Searching Data Structures 

In what follows, when stating data structure results, we assume that all input point sets are in rank space, 
i.e., they have coordinates on the integer grid [n] d = {0, . . . , n — l} d . This assumption is for convenience 
only: in a w-bit word RAM when all coordinates are in [U] d with U = 2 W , we can always reduce to the 
rank-space (U = n) case by adding to the query time bound a term proportional to the cost of predecessor 
search |[55l . which is e.g. OilgXgU) by van Emde Boas trees [59] or 0(\g w n) by fusion trees QUI . After 
rank space reduction, all the algorithms mentioned use only RAM operations on integers of 0(lg n) bits. (The 
predecessor lower bound holds even for range emptiness in 2-d, so the additive predecessor cost in the upper 
bound is optimal.) 

Range reporting in 2-d. The most basic version of orthogonal range searching is perhaps range reporting 
in 2-d (finding all points inside a query range). Textbook description of range trees [56 1 implies a solution 
with 0(n\gn) space and Oilgn + k) query time, where k denotes the output size of the query (i.e., the 
number of points reported). Surprisingly, the best space-query bound for this basic problem is still open. 
Chazelle lfl9l gave an 0[n) -space data structure with 0(lg n + k lg £ n) query time, which has been reduced 
slightly by Nekrich lIBTIl to 0(lgn/lglgn + k\g £ n). (Throughout the paper, e > denotes an arbitrarily 
small constant.) Overmars [52] gave a method with O(nlgn) space and 0(lglgn + k) query time. This 
query time is optimal for 0{n \g°^ n)-space structures in the cell probe model (even for range emptiness 
in the rank-space case), by reduction from colored predecessor search [55 1. Alstrup, Brodal and Rauhe [|8] 
presented two solutions, one achieving 0(n lg £ n) space and optimal 0(lg lg n + k) query time, and one with 
0(n lg lg n) space and 0(lg 2 lg n+k lg lg n) query time, both improving Chazelle's earlier data structures lfl9l 
with the corresponding space bounds. In Section [2j we present two new solutions. Our first solution achieves 
0(n lg lg n) space and 0((1 + k) lg lg n) query time, thus strictly improving Alstrup et al.'s second structure. 
Secondly, we present an 0(n)-space data structure with query time 0((1 + k) \g e n), significantly improving 
the first term of Nekrich's result. 

We can also solve range emptiness in 2-d (testing whether a query rectangle contains any input point) 
by setting k = 0. Here, our results are the most attractive, improving on all previous results. For example, 
our method with O(ralglgn) space has optimal O(lglgn) query time, and simultaneously improves both of 
Alstrup et al.'s solutions (0(n lg £ n) space and 0(lg lg n) time, or 0(n lg lg n) space and 0(lg 2 lg n) time). 
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Range reporting in 3-d. By a standard reduction, Alstrup et al.'s first 2-d result directly implies a data 
structure for 3-d orthogonal range reporting with space 0(n\g 1+e n) and query time 0(lgn + k); this im- 
proved an already long chain of previous work. Nekrich [50 1 was the first to achieve sublogarithmic query 
time for 3-d orthogonal range reporting: his data structure has 0(nlg 4 n) space and 0(lg 2 lgn + k) query 
time. Afshani [ Q subsequently improved the space to 0{n lg 3 n) while maintaining the same 0(lg 2 lgn + k) 
query time. Karpinski and Nekrich [40] later reduced the space to 0(n lg 1+e n) at the cost of increasing the 
query time to 0(lg 3 lgn + k), by borrowing ideas of Alstrup et al. [8 |. In these methods by Afshani [[TJ and 
Karpinski and Nekrich [40], two of the lglgn factors come from orthogonal planar point location. By using 
the most recent result on orthogonal point location by Chan iffll . one of the lg lg n factors can automatically 
be eliminated in all of these time bounds. This still leaves the query time of Karpinski and Nekrich's structure 
at 0(lg 2 lgn + k), however. In Section [3l we present a new method with 0(nlg 1+e n) space and optimal 
0(lg lg n + k) query time, simultaneously improving all previous methods that have linear dependence in 

Range reporting in higher dimensions. By a standard reduction, the previous 3-d results [Q] |40] [14] imply 
data structures for d-dimensional orthogonal range reporting with 0(n lg d n) space and 0(lg rf_3 n lg lg n + k) 
query time, or 0(n lg d ~ 2+£ n) space and 0((lg n/ lg lg n) d ~ 3 lg 2 lg n + k) query time for d > 4. Our result 
implies a ci-dimensional data structure with 0(n lg d ~ 2+£ n) space and 0((lg n/ lg lg n) d ~ 3 lg lg n + k) query 
time. This query bound is the best known among all data structures with 0(n \g°^ n) space; our space bound 
is the best known among all data structures with 0(lg°^ n + k) query time. 

The 4-d case is especially nice, as we get 0(n lg 2+£ n) space and 0(lgn + k) query time. This query 
time almost matches Patra§cu's Q(lgn/ lglgn) lower bound ll54l for 0{n\g ^ n)-space structures in the 
cell probe model for 4-d emptiness. 

Range minimum in 2-d. Our 3-d range reporting method can also be modified to give a new result for the 
2-d range minimum query problem (see the appendix), with 0(n lg £ n) space and 0(lg lg n) query time. 

1.2 Offline Range Searching 

Finally, in Section [4j we turn to offline (or batched) versions of orthogonal range searching where all queries 
are given in advance; the goal is to minimize the total time needed to answer all queries, including preprocess- 
ing. Offline problems are important, as efficient algorithms are often obtained through the use of efficient data 
structures in offline settings. Offline problems also raise new challenges, beyond simply the issue that pre- 
processing times sometimes get ignored in analysis of data structures in the literature. Interestingly, the com- 
plexity of offline problems may be fundamentally different from their online counterparts: examples include 
predecessor search (where the offline problem is related to integer sorting and can be solved in 0{^\g lgn) 
expected time per query 11371 "). orthogonal 2-d range counting (where recently Chan and Patra§cu [16 ] have 
obtained an offline 0{^J\gn) bound per query, better than the online 0(lg n/ lg lg n) bound), and nonorthog- 
onal 2-d point location (where Chan and Patra§cu [ 17 1 have obtained an offline 2°^ lg lgn ) bound, better than 
the current online 0(lgn/ lglgn) or 0{^f\g U / lg lg U) bound [15]). 

Offline dominance reporting in 4-d and the rectangle enclosure problem in 2-d. Our main result on 
offline range searching is a new algorithm for the offline 4-d dominance reporting problem: given n input 
points and n query points, report for each query point q all input points that are dominated by q. Here, 
p = (xi, . . . , Xd) is dominated by q = (ai, . . . , a^) iff xi < ai for every i, i.e., p lies inside the d-sided range 

1 As Karpinski and Nekrich 1 4-0 1 observed, space can be slightly reduced to 0(n lg n lg ^ 1 ) lg n) if one is willing to give up linear 
dependence in k, with query time 0(lg 2 lg n + k lg lg n). 
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(— oo, ax] X • • • X (—00, ad] (an orthant). In other words, given n red points and n blue points, we want to 
report all pairs (p, q) where the red point p is dominated by the blue point q. We give a randomized algorithm 
that solves this problem in 0(n lg n + k) expected time in 4-d, where k denotes the total output size. 

(Note that the best known online data structure for 4-d dominance reporting with 0(lg n + k) query time 
requires 0(n lg 1+e n) space and preprocessing time at least as big, and thus is not applicable here.) 

In the literature, offline 4-d dominance reporting is studied under the guise of the 2-d rectangle enclosure 
problem: given n rectangles in 2-d, report all pairs (n, T2) where rectangle n completely encloses rectangle 
T2- By mapping each rectangle to a point in 4-d, it is easy to see that the problem reduces to offline 4-d 
dominance reporting (in fact, it is equivalent to dominance reporting in the "monochromatic" case, where 
we equate the query point set with the input point set). This classical problem has the distinction of being 
the last problem covered in Preparata and Shamos' standard textbook [56]. In the early 1980s, Vaishnavi 
and Wood [58] and Lee and Preparata P31 both gave 0(n lg 2 n + fc)-time algorithms. The main result of a 
SoCG'95 paper by Gupta et al. [36] was an OQnlgn + k] lglgn)-time algorithm. An alternative algorithm 
by Lagogiannis et al. PRll obtained the same time bound. A number of researchers (the earliest seems to be 
Bentley and Wood [12]) raised the question of finding an 0(n lg n+/c)-time algorithm. Particularly frustrating 
is the fact that obtaining 0(n Ign + k) time is easy for the similar-sounding rectangle intersection problem 
(reporting all pairs of intersecting rectangles). Our new randomized algorithm shows that the 2-d rectangle 
enclosure problem can be solved in 0(n lg n + k) time, finally resolving a 3-decades-old question. 

By a standard reduction, our result implies a randomized 0(nlg d ~ 3 n + fc)-time algorithm for offline 
dominance reporting for any constant dimension d > 4. 

Offline dominance emptiness and the maxima problem. Our algorithm can also solve the offline domi- 
nance emptiness problem in 0(n lg d ~ 3 n) expected time for any constant d > 4: here, given n input points 
and n query points, we want to decide for each query point q whether some input point is dominated by q. 

A notable application is the maxima problem: given n points, identify all maximal points, i.e., points that 
are dominated by no other point. Like its cousin, the convex hull problem, this problem plays a fundamental 
role in computational geometry and is often used as examples to illustrate basic algorithmic techniques. It 
has many applications and is related to concepts from other fields (e.g., skyline queries in databases and 
Pareto optimality in economics). The earliest result for dimensions d > 3 was Kung, Luccio, and Preparata's 
0{n lg d " 2 n)-time algorithm g2|56l from 1975. While progress has been made on probabilistic results for 
random point sets |[TTll27ll34l . output-sensitive results ETIRTI . and even instance-optimal results H, the best 
worst-case result for the maxima problem has remained the one from Gabow, Bentley, and Tarjan's classic 
STOC'84 paper ll33l . (That paper is well remembered for introducing Cartesian trees.) Gabow et al.'s time 
bound is 0(n lg d ~ 3 nig lgn) for d > 4. Our (randomized) result implies the first improvement in two and a 
half decades: 0(n lg d ~ 3 n). In particular, we obtain the first 0{n lg n) algorithm for the 4-d maxima problem. 

Other applications. Our offline dominance result also leads to the current best results for other standard 
problems (see the appendix), such as bichromatic Loo-closest pair and -minimum spanning tree for d > 4. 

Organization. In the three subsequent sections, we describe our new methods for 2-d orthogonal range 
reporting, 3-d orthogonal range reporting, and offline 4-d dominance range reporting. These sections are 
independent of each other and can be read separately. Interestingly, our techniques for 2-d range reporting are 
not based on Alstrup et al.'s previous grid-based approach [8] but draw on new ideas related to succinct data 
structures. Our 3-d range reporting structure is based on Alstrup et al.'s approach, but with new twists. Finally, 
our 4-d offline algorithm involves an unusual (and highly nonobvious) mixture of bit-packing techniques lfl6l 
and classical computational geometric tools (Clarkson-Shor-style random sampling). 
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2 Range Reporting in 2-d 



The goal of this section is to prove: 

Theorem 2.1. For any 2 < B < \g e n, we can solve 2-d orthogonal range reporting in rank space with: 
0(n-B lg lgn) space and 0(lg lg n+k-\g B \gn) query time; or 0{n-\g B lgn) space and {l + k)-0{B lg lg n) 
query time. 

At the inflection point of the two trade-offs, we get a data structure with space 0(n lg lg n) and query time 
(1 + k) ■ 0(lg lg n). At one extreme point, we have space 0(n) and query time (1 + k) ■ 0(lg e n). At the other 
extreme, we have space 0(n lg £ n) and query time 0(lg \gn-\- k), thus matching the bounds of Alstrup et al. 
Note that all these tradeoffs also apply to emptiness with k set to 0. This can be seen through a black-box 
reduction: Assume a reporting data structure with query time t\ + t2k is available. Given an emptiness query, 
run the query algorithm on the reporting data structure using the same query. If the query algorithm terminates 
within ti computation steps, we immediately get the answer, otherwise we terminate after t\ + 1 operations, 
at which point we know k > and thus we know the range is nonempty. 

We will describe a linear-space reduction from 2-d orthogonal range reporting to the following "ball 
inheritance" problem, a pointer-chasing-type problem reminiscent of fractional cascading ll24l . Consider a 
perfect binary tree with n leaves. Also consider n labelled balls, which appear in an ordered list at the root 
of the tree. We imagine distributing the balls from the root down to the leaves, in lg n steps. In the i-th step, 
a node on level lgn — i contains a subset of the balls in an ordered list, where the order is the same as the 
original order at the root. Each ball chooses one of the two children of the node and is "inherited" by that 
child. The number of balls in each node across a level is the same. That is, on level i, each node contains 
exactly 2* balls, and each leaf contains exactly one ball. 

Given the inheritance data described above, the goal is to build a data structure that answers the following 
type of query: given a node and an index into its list of balls, what leaf does the indicated ball eventually reach? 
We may imagine each ball as having lg n copies, one at each node on its root-to-leaf path. Conceptually, each 
ball stores a pointer to its copy on the level below. The identity of a ball on level i consists of a node at level i, 
and the index of the ball in the node's list. The goal is, given (the identity of) a ball on some level, to traverse 
the pointers down to the tail of the list, and report the leaf's identity. 

In Section I2TT1 we give space/time trade-offs for the problem with results parallel to Theorem 12.11 These 
trade-offs come out naturally given the definition of ball inheritance: our data structure mimicks skip lists on 
n independent lists with the copies of each ball. In Section l22l we give a reduction from 2-dimensional range 
reporting to this abstract ball-inheritance problem. 

2.1 Solving the Ball-Inheritance Problem 

This subsection will prove: 

Lemma 2.2. For any 2 < B < lg 6 n, we can solve the ball-inheritance problem with: (I) space 0(nB lg lg n) 
and query time 0(lg B lg n); or (2) space 0(n lg B lg n) and query time 0(B lg lg n). 

Using standard techniques, one can represent the pointers on each level of the tree with 0(n) bits such 
that we can traverse a pointer in constant time. This uses the rank problem from succinct data structures: 
represent a bit vector A[l . . n] using 0(n) bits, to answer rank(fc) = J2i<k ^ n f act solutions with very 
close to n bits of space and constant query time are known ll53l . For every node, we store a solution to the 
rank problem among its balls, where "0" denotes a ball going to the left child, and "1" a ball going to the right 
child. The index of a ball in the right child is rank(i) evaluated at the parent. The index of ball i in the left 
child is i — rank(i). 
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This trivial data structure uses O (nigra) bits in total, or 0(n) words, but has O(lgra) query time. For 
faster queries, the query will need to skip many levels at once. We use an easy generalization of rank queries, 
which we prove in the appendix: 

Lemma 2.3. Consider an array A[l..n] with elements from some alphabet E. We can construct a data 
structure of 0(ralg S) bits which answers in constant time rank(fc) = the number of elements in A[l . . k] 
equal to A[k]. 

In our context, the lemma implies that we can store all pointers from balls at level i to balls at level i + A 
using O(raA) bits of space. Indeed, each ball can be inherited by 2 A descendants of its current node (the 
alphabet E will denote this choice of the descendant A levels below). To compute the index of a ball in the 
list of its node at level i + A, we simply have to count how many balls before it at level i go to the same 
descendant (a rank query). 

For intuition of how to use this building block, consider an abstract problem. We need to augment a linked 
list of rra nodes in a manner similar to a skip list. Any node is allowed to store a pointer A nodes ahead, but 
this has a cost of A. The goal is to reach the tail of the list from anywhere in a minimal number of hops, 
subject to a bound on the total cost of the skip pointers. For any 2 < B < rra, we can solve this problem as 
follows: 

• Traversal time 0(lg B m) with pointer cost 0(m-B lg B rra). Define the level of a node to be the number 
of trailing zeros when writing the node's position in base B. Each node on level i stores a pointer to the 
next node on level i + 1, or to the tail if no such node exists. The cost of a node at level i is 0(B l+1 ), 
which is 0(-pr-B i+1 ) = 0{mB) across a level. The traversal needs to look at 0(lg B rra) pointers (one 
per level) before reaching the tail. 

• Traversal time 0{B \g B rra) with pointer cost 0(m • lg B rra). Each node on level i stores a pointer that 
skips B % nodes, or to the tail if no such node exists. In other words, each node on level % stores a pointer 
to the next node on level i or higher (whichever comes first). The cost of a node on level i is 0{B % ), 
so all nodes on level i cost 0(j~B l ) = O(rra). The total cost is thus 0(m lg B rra). We can reach the 
tail from anywhere with 0(B lg B rra) pointer traversals, since we need at most B nodes on each level, 
before reaching a node on a higher level. 

Returning to the ball-inheritance problem, we will implement the above strategies on the n lists of copies 
of each ball, using Lemma [231 to store pointers. We use the first strategy in the regime of fast query time, but 
higher space (tradeoff (1) in Lemma 1272b . Nodes on levels of the tree that are a multiple of B l store pointers 
to the next level multiple of B i+1 . This costs 0(B i+1 ) bits per ball, so the total cost is ^ • 0(B i+1 ) = 
0(lg n ■ B ■ \g B lg ra) bits per ball. This is 0{nB \g B lg ra) words of space. The query time is 0(lg B lg ra), 
since in each step, we jump from a level multiple of B % to a multiple of B i+1 . Since the bound is insensitive to 
polynomial changes in B, the trade-off can be rewritten as: space 0(nB lg lg n) and query time 0(lg B lg ra). 

The second strategy gives low space, but slower query, i.e. tradeoff (2) in Lemma fL2l Nodes on levels 
that are a multiple of B % store pointers to B % levels below (or to the leaves, if no such level exists). The cost of 
such a level is 0(B l ) bits per ball, so the total cost is ^ ^ • 0{B l ) = 0(lgra • \g B lgra) bits per ball. This 
is space 0{n\g B lgra) words. The query time is 0(B lg B lgra), since we need to traverse at most B levels 
that are multiples of B % before reaching a level multiple of B l+1 . Thus, we obtain query time 0{B lglgn) 
with space O (ra \g B lg ra) . 

2.2 Solving Range Reporting 

This subsection will show: 
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Lemma 2.4. If the ball inheritance problem can be solved with space S and query time r, 2-d range reporting 
can be solved with space 0(S + n) and query time O ( lg lg n + (1 + k) • r) . 

Consider n points in 2-d rank space; we may assume n to be a power of two. We build a perfect binary 
tree over the x-axis. Each ball will represent a point, and the leaf where the ball ends up corresponds to its x 
coordinate. The order of balls at the root is the sorted order by y coordinate. We store a structure for this ball 
inheritance problem. The true identity of the points (their x and y coordinates) are only stored at the leaves, 
taking linear space. We will now describe additional data structures that allow us to answer range reporting 
with query time O(lglgn), plus (O(l) + k) ball inheritance queries. 

Our first ingredient is a succinct data structure for the range minimum problem (RMQ). Consider an array 
A of n keys (which can be compared). The query is, given an interval [i, j], report the index of the minimum 
key among A[i], . . . , A\j]. Note that a data structure for this problem does not need to remember the keys. 
Information theoretically, the answer is determined if we know the Cartesian tree [33 1 of the input; a tree 
takes just 2n bits to describe. Effective data structures matching this optimal space bound are known. See, for 
example, [29], which describes a data structure with 2n + o(n) bits of space and 0(1) query time. 

In each node that is the right child of its parent, we build a succinct RMQ data structure on the points 
stored in the subtree rooted at that node. In this structure, we use the y rank of the points as indices in the 
array, and their x coordinates as keys. In each node that is a left child of its parent, we build a range-maximum 
data structure (equivalently, an RMQ data structure on the mirrored input). Since each data structure takes a 
number of bits linear in the size of the node, they occupy a total of 0(n lg n) bits, i.e. O(n) words of space. 

To report points in the range [xi, X2] x [y\,y2\, we proceed as follows: 

1. Compute the lowest common ancestor LCA(xi, X2) in the perfect binary tree. This is a constant time 
operation based on the xor of x\ and X2- the number of zero bits at the end indicates the height of the 
node, and the rest of the bits indicate the nodes identity. (For instance, we can use an array of n entries 
to map the x\ © X2 to the right node.) 

2. We convert [2/1,2/2] into the rank space of points inside the left and right child of LCA(xi, X2). This 
entails finding the successor y\ of y% among the y values of the points under the two nodes, and the 
predecessor j/2 of V2- See below for how this is done. 

3. We descend to the right child of LCA(xi, X2). Using the RMQ data structure, we obtain the index 
m (the y rank) of the x-minimum point in the range [yi , 1/2] . We use the ball-inheritance structure to 
find the leaf of this point. We retrieve the x coordinate of the point from the leaf, and compare it to 
X2- If greater, there is no output point in the right node. If smaller, we report this point and recurse in 
[yi,m — 1] and [m + 1, 2/2] to report more points. We finally apply the symmetric algorithm in the left 
child of LCA(xi, X2), using the range maxima until the points go below x\. 

The cost of step 3 is dominated by the queries to the ball-inheritance problem. The number of queries is two 
if the range is empty, and otherwise at most twice the number of points reported in each child of the LCA. 

We now describe how to support step 2 in 0(t + lg lg n) time, with just O(n) space in total. We will use 
a succinct index for predecessor search. Consider supporting predecessor queries in a sorted array A[l . . n] 
of w-bit integers. If we allow the query to access entries of the array through an oracle, it is possible to 
obtain a data structure of sublinear size. More precisely, one can build a data structure of 0(nlgw) bits, 
which supports queries in 0(lg w) time plus oracle access to 0(1) entries; see ll35l Lemma 3.3]. (This idea is 
implicit in fusion trees [32 1, and dates further back to d.) 

We build such a data structure on the list of y coordinates at each node. The oracle access to the original 
y coordinates is precisely what the ball-inheritance problem supports. Since our points have coordinates of 
O(lgn) bits, each data structure uses O(lglgn) bits per point, so the total space is 0(n lglgn) words. To 
reduce the space to linear, we store these predecessor structures only at levels that are a multiple of lglgn. 
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From LCA(xi, #2), we go up to the closest ancestor with a predecessor structure. We run predecessor queries 
for yi and 772 at that node, which take 0(lg lg n) time, plus 0(1) queries to the ball-inheritance problem. Then, 
we translate these predecessors into the rank space of the left and right child of LCA(xi, X2), by walking down 
at most lg lg n levels in the ball-inheritance problem (with constant time per level). This concludes the proof 
of Lemma l2.4l which combined with Lemma l2T2l proves Theorem 12. II 

3 Range Reporting in 3-d 

In this section, we present a new data structure for 3-d orthogonal range reporting. We find it more convenient 
now to ignore the default assumption that points are given in rank space. The special case of dominance (i.e., 
3-sided) reporting can already be solved with 0(n) space and 0(lglg U + k) time by known methods Q, 
using the latest result on orthogonal planar point location lfl4l . We will show how to "add sides" without 
changing the asymptotic query time and without increasing space by too much. 

3.1 The 3-d 4-Sided Problem 

Our method is based on a simple variant of Alstrup, Brodal, and Rauhe's grid-based method [8]. Instead of 
a grid of dimension near ^fn x ^fn as used by Alstrup et al.'s and Karpinski and Nekrich's method BOl . our 
key idea is to use a grid of dimension near (n/t) x t for a judiciously chosen parameter t. 

Specifically, consider the problem of answering range reporting queries for 4-sided boxes in 3-d, i.e., the 
boxes are bounded in 4 out of the 6 sides where the unbounded sides are from different coordinate axes. 
W.lo.g., assume that query ranges are unbounded from below in the y and z directions. Suppose that there is 
a base data structure for solving the 3-d 4-sided problem with So(n) space in bits and Qo(n, k) query time. 

The data structure. Fix parameters t and C to be set later. Let S be a set of n points in [C/] 3 . Build a 2-d 
grid on the xy-plane with n/(Ct) rows and t columns, so that each row contains Ct points and each column 
contains n/t points H The number of grid cells is n/C. 
We build a data structure for S as follows: 

0. For each of the t columns (in left-to-right order), build a data structure recursively for the points inside 
the column. 

1. For each of the n/(Ct) rows, build a base data structure directly for the points inside the row, with 
So(Ct) space and Qo(Ct,k) query time. Also build a predecessor search structure for the n/(Ct) 
horizontal grid lines. 

2. For each of the t columns, build a 3-sided reporting data structure lH4~l for the points inside the column, 
with 0(n/t) words of space, or O ((n/t) lg U) bits of space, and 0(lg lg U + k) query time. 

3. Let G be the set of at most n/C points formed by taking the z-lowest point out of each nonempty 
grid cell. Build a data structure for G for 4-sided queries using any known method [14] with 

0((n/C) lg°^ n) space and 0(lglg U + k) query time. 

4. Finally, for each nonempty grid cell, store the list of all its points sorted in increasing z-order. 

2 Abusing notation slightly, we will not distinguish between a 2-d region (e.g., a row, column, or grid cell) and its lifting in 3-d. 
For simplicity, we ignore floors and ceilings. 
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Note that by unfolding the recursion, we can view our data structure as a degree-i tree T where the points in 
the leaves are arranged in x-order and each node stores various auxiliary data structures (items 1^1). 
The space usage in bits satisfies the recurrence 

S(n) = tS(n/t) + (n/(Ct))S (Ct) + 0(nlgU + {n/C)\g° {l) n). 

For the base case, we have S[n) = 0(So(Ct)) for n < Ct. Solving the recurrence gives 

S(n) = O(lg t n-[(n/(Ct))S (Ct) + n\gU + (n/C) lg°« n]) 

(assuming that So(n)/n is nondecreasing). The third term disappears by setting C = lg c n for a sufficiently 
large constant c. 

The query algorithm. Suppose we are given a query range q = [xl,xr] x (—00, yo] X (—00, zq] and the 
x-ranks of xl and xr w.r.t. the input point set. Let v be the lowest common ancestor of the two leaves of T 
whose x -range contain xl and xr. We can find v by performing a word operation on the two given x-ranks 
(no special LCA data structures are required since T is perfectly balanced). 

From now on, we work exclusively at node v of the tree. There, q intersects more than one column. Say 
xl and xr are in columns jx and jr (which can be identified in O(l) time as we know the x-ranks). Say yo 
is in row i, computable by predecessor search in 0(lg lg U) time.We can then answer the query as follows: 

1. Let qx be the ("top") portion of q inside row i. Report all points in qx (which is 4-sided) by the base 
data structure at row i. The cost is Qo(Ct, k') if k' denotes the number of points in qx- 

2. Let qx and qR be the portions of q — qx inside columns jx and jr respectively. Report all points in 
qx and qR (which are 3-sided inside columns jx and jr respectively) by the 3-sided data structure at 
columns jx and jr. The cost is 0(lg lg U) plus the number of points in qx and qR. 

3. Let qj be the remaining ("interior") portion q — (qx U qL U qR). Find all points of G in q by querying 
the data structure for G. The cost is at most 0(lg lg U) plus the number of points in qx 

4. For each point seG found in step 3, report all points in s's grid cell with z-values below zq by a linear 
search over the cell's z-sorted list. The cost is linear in the number of points in qj. 

The overall query time is thus Q(n, k) = Qo(Ct, k') + 0(lg lg U + k — k') for some k' < k. 

Bootstrapping. Assume the availability of a solution with So(n) = 0(n lg U+n lg 1+1 /^ n) and Qo(n, k) = 
0(lglg U + k) for a constant I. (For a base case with £ G (0, 1], we can start with any known method with 
0(n Ig ^ n) space and 0(lg lg U + k) query time 03].) Setting t with lg t = lg £ /^ +1 ) n then yields 

S(n) = 0((lgn)/(lgt) • [n\gU + n\g 1+1 ^ Ct}) = 0(n lg U IgV^+i) n ) 

andQ(n,A;) = 0(lg lg U + k). 

We need one last trick: rank space reduction. Initially, store the x-, and y-, and z-values in sorted arrays, 
build predecessor search structures for them, and afterwards, replace all values by their ranks. This way, we 
have reduced U to n, and the space (in bits) of the data structure improves to 0(n lg U +n lg 1+1 /(^ +1 ) n). For a 
query range q, we can initially determine the x-, y-, and 2-ranks of g's endpoints in 0(lg lg U) time [59 ] before 
running the query algorithm. Incidentally, this also fulfills the assumption that the x-ranks of q's endpoints 
are given. After the query, we can recover the x-, y-, and z-values of each reported point by looking up the 
sorted arrays. The query time remains 0(lg lg U + k) (though the constant factor in the k term increases). 

By bootstrapping fl/e] times, we finally obtain a solution for the 3-d 4-sided problem with 0(n lg U + 
n \g l+e n) bits of space, i.e., 0(n lg 6 n) words of space (by packing), and 0(lg lg U + k) query time. 
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3.2 The 3-d 5-Sided/6-Sided Problem 



We can solve the 3-d 5-sided problem in the same way. W.l.o.g., assume that the ranges are unbounded from 
below in the z direction. Item 2 now stores 4-sided data structures, and space increases by a lg e n factor 
only as a result. The query algorithm proceeds similarly. We now have an additional bottom portion qs, but 
It ,Qb, Ql,Qr are all 4-sided, unless q lies completely inside a column (in which case we only need one query 
to a base data structure). Our method thus solves the 3-d 5-sided problem with 0(n lg°^ n) words of space 
and 0(lg lg U + k) query time. 

It is known (e.g., see |[50l ) that the j-sided problem can be reduced to the (j — l)-sided problem by standard 
binary divide-and-conquer, where the space increases by a logarithmic factor but the query time is unchanged 
(if it exceeds lg lg). Thus, we can get the following result for the 3-d general (i.e., 6-sided) problem: 

Theorem 3.1. There is a data structure for 3-d orthogonal range reporting with 0(n\g l+£ n) space (in 
words) and 0(lg lg U + k) query time. 

3.3 Higher dimensions and applications. 

It is known that d-dimensional orthogonal range reporting can be reduced to (d — 1) -dimensional orthogonal 
range reporting by using a range tree with fan-out b, where the space increases by a lg b n factor and 
the query time increases by a lg b n factor. By setting b = lg £ n (and applying rank space reduction at the 
beginning), Theorem l3. 1 l implies: 

Corollary 3.2. There is a data structure for d-dimensional orthogonal range reporting for any constant d > 4 
with 0(n \g d ~ 2+£ n) space and 0((lg n/ lg lg n) d ~ 3 lg lg n + k) query time. 

Our method also works for emptiness queries; the same bounds hold with k set to 0. Here, dominance 
emptiness structures are sufficient in item 2, and item 4 is unnecessary. 

Range minimum queries (finding the point inside a query range with the minimum priority, assuming that 
each input point is given a priority value) are closely related. For example, the decision version of 2-d range 
minimum queries (deciding whether the minimum is at most a given value) reduces to 3-d 5-sided emptiness 
queries. It is no surprise then that we can obtain the same result for 2-d range minimum queries as 3-d 5-sided 
emptiness. Here, in item 2, we need to replace 3-d 3-sided emptiness structures with 2-d dominance range 
minimum structures, but 2-d dominance range minimum reduces to point location in the vertical projection of 
a lower envelope of 3-d orthants. This is an orthogonal 2-d point location problem, which can be solved with 
0(n lg U) space in bits and 0(lg lg U) query time [14]. The same analysis thus carries through. 

Theorem 3.3. There is a data structure for 2-d range minimum queries with 0(n lg e n) space and 0(lg lg U) 
query time. 

In contrast, modifying Karpinski and Nekrich's 3-d range emptiness method [40] yields a data structure 
for 2-d range minimum queries with 0(n lg°^ lg n) space but 0(lg 2 lg n) query time. 

Our method can also give an alternative solution to 2-d orthogonal range reporting with 0(n \g e n) space 
and 0(lg lg U + k) time. This solution is arguably slightly simpler than Alstrup, Brodal, and Rauhe's original 
method (H, as their method requires constant-time 1-d range queries as a subroutine, and also requires a more 
intricate way of handling rank space reduction. 

4 Offline Range Reporting 

In this section, we present our 0(n\gn + k) expected time solution for the offline 4-d dominance reporting 
problem on n query points and n input points, where k denotes the total output size of all n queries. In this 
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section, we fix w = e lg N where N denotes the maximum input size. Any tu-bit word operation we introduce 
can be simulated in 0(1) time by table lookup, after preprocessing in sublinear time 2°^ = N 0( - £ \ 

4.1 Preliminaries 

We begin by describing some key subroutines and tools we need. The first subroutine is an algorithm for a 
special case of offline 2-d orthogonal point location. Chan and Patra§cu [ 16 ] recently studied the offline 2-d 
orthogonal range counting problem and obtained a linear-time algorithm for the case when the number of 
points is smaller than 2 0( -^"\ From this result, they then obtained an 0(n^\gn) algorithm for the general 
case. We apply their bit-packing technique and show that a similar result holds for orthogonal point location 
(see the appendix for the proof): 

Lemma 4.1. There is an algorithm for offline 2-d orthogonal point location on n query points and n disjoint 
axis-aligned rectangles that runs in time 0(n) if n < 2°(^^ and the coordinates have been presorted. 

The second subroutine is a preliminary method for the offline d-dimensional orthogonal range report- 
ing problem. A straightforward 6-ary version of the range tree, combined with a trivial method for the 1-d 
base case, easily gives the following bound, which with the right choice of b will turn out to be crucial in 
establishing our 4-d result: 

Lemma 4.2. There is an algorithm for offline d-dimensional orthogonal range reporting on n points and m 
boxes that runs in time 0(n Igf 1 n + b d ~ 1 m\gft~ 1 n + k), where b > 2 is a parameter and k is the total 
output size, if coordinates have been presorted. 

The main geometric tool we use is a randomized version of shallow cuttings in 3-d. Let 5 be a set of 
n points in 3-d. Pick a random sample R where each point of 5 is included independently with probability 
l/K for a fixed parameter K. Define the staircase polyhedron V(R) to be the lower envelope of the orthants 
O s = [x, oo) x [y,oo) x [z, oo) over all the points s = (x,y,z) G R. Note that the vertices of V(R) 
include all the minimal points of R (and possibly extra points not in R); this orthogonal polyhedron V(R) 
has 0(|i?|) number of vertices (by Euler's formula) and can be computed in time 0(|-R| lg |i?|) by adapting 
a standard algorithm for 3-d minima lf33l 1561 (the time bound can be improved on the RAM). Let VV(R) 
denote the cells in the vertical decomposition of the region underneath V(R). The decomposition is defined 
as follows: take each horizontal face (a polygon) of V(R) and form a 2-d vertical decomposition of the face by 
adding y-vertical line segments at its vertices; finally, extend each resulting subface (a rectangle) downward 
to form a cell touching z = — oo. The decomposition VV(R) has 0(|-R|) size and can be computed in 0(|-R|) 
additional time. For each cell A € W(R), we define its conflict list Sa to consist of all points s£S with O s 
intersecting A; equivalently, 5a consists of all points in 5 that are dominated by the top-upper-right corner 
va of A. The decomposition VV(R) and its conflict lists, which together we refer to as a randomized shallow 
cutting of 5, satisfy some desirable properties: 

Lemma 4.3. For a random sample Rof S with E[|i2|] = n/K, 

(a) niax^ g vx>(_R) |5^| = O(KlgN) with probability at least 1 — 1/N for any N > n; 



(b) E 



Ea 6 vx>(/?) I-SaI = 0(n); 



(c) if a query point q dominates exactly k points of S, then q is covered by VT>(R) (i.e., lies below V(R)) 
with probability at least 1 — k /K. 
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Proof, (a) and (b) follow from the general probabilistic results by Clarkson and Shor [26 28 1; (c) is obvious 
by a union bound: if q is above V(R), then some point of S dominated by q must be chosen in R. □ 

Matousek [48] provided a deterministic version of shallow cuttings satisfying similar, slightly stronger 
properties (without the extra logarithmic factor in (a) and with probability 1 in (c) for k < K). Originally, 
shallow cuttings were developed for halfspace range reporting and defined in terms of arrangements of planes 
rather than orthants; the first application to dominance range reporting was proposed by Afshani [fl]. (A 
similar concept specific to the case of dominance called t-approximate boundary had also appeared [60] [50].) 

For our offline problem, however, preprocessing cost matters and the above simpler randomized version 
is more suitable than its deterministic counterpart. Note that it is not advisable to use shallow cuttings in 4-d 
directly, since the number of vertices in the staircase polyhedron V(R) can be quadratic in \R\ in 4-d. 

4.2 Offline 3-d Dominance Reporting 

We warm up by illustrating how randomized shallow cuttings can help solve the offline dominance reporting 
problem in the 3-d case. The derived solution playes a key role in our 4-d solution. We assume that the given 
n input points and n query points have been pre-sorted. 

Algorithm. We pick a random sample R of the input points, where each point is sampled with probability 
l/K with K := lg n. We first compute V(R) and VV(R). 

We next compute the conflict lists for all the cells of VV(R) as follows. For each input point s, it suffices 
to identify all cells whose conflict lists include s. We first find the cell A € VT>(R) containing s; this reduces 
to a 2-d point location query in the xy-projection of VV(R). The top-upper-right corner va of A gives us an 
initial vertex that dominates s. We observe that all vertices of the polyhedron V(R) that dominate the point s 
form a connected subgraph in the graph (the 1 -skeleton) induced by the polyhedron. Furthermore, the degree 
of each node in the graph is at most 3. Thus we can perform a breadth-first search from the initial vertex found 
to generate all vertices of V(R) dominating v, yielding all conflict lists that include v. The total time over all 
input points v, excluding the initial point location queries, is linear in the total size of all conflict lists. 

For each query point q, we find the cell of W{R) containing q; this again reduces to a 2-d point location 
query in the projection of W{R). If no cell is found (i.e., q is above V(R)), then we say that q is bad; 
otherwise it is good. For each cell A 6 VT>{R), we run an existing algorithm A$ to solve the offline 3-d 
dominance reporting subproblem for the input points in the conflict list of A and the query points inside A. 

This answers all good queries correctly. To finish, we recursively solve the offline 3-d dominance reporting 
problem on the bad queries and all the input points, where the roles of queries and input points are now 
reversed. Note that we also reverse the dominance relation, or equivalently, negate all coordinates. After 
recursing twice, however, we terminate by switching to a known 0(n lgn + /c)-time algorithm (e.g., P31I561 ). 

Analysis. Assume that the offline 2-d point location on n pre-sorted rectangles and query points takes 
0(nQ PL (n)) time for some non-decreasing function Q PL (-)- Assume that the initial algorithm A$ for of- 
fline 3-d dominance reporting on n pre-sorted input and query points takes 0(nQo(n) + k) (expected) time 
for some non-decreasing function Qo(-). Note that this implies that the running time for n input points and 
m query points is 0{{n + m)Qo( n ) + k), by dividing the query points into \m/n\ groups of size at most n 
when m > n. 

Our algorithm spends expected time at most 0((n/K)\gn) = 0(n) to compute V(R) and VV(R). 
Performing point locations on the xy-projection of VV(R) (a subdivision of expected size 0(n/K)) for both 
the input and query points takes time at most 0(nQ PL (n)). Observe here that the input and query points have 
been pre-sorted, and so the rectangles can also be pre-sorted in linear time. Constructing the conflict lists 
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by the breadth-first searches takes expected time 0(n) since they have expected size 0(n) by Lemma l4~3t b). 
Also by Lemma l4~37 a). every conflict list has size 0(K lg n) w.h.p.; if this condition is violated, we can afford 
to switch to a trivial polynomial upper bound on the running time. Since the total expected size of the 3-d 
dominance reporting subproblems at the cells is 0(n), these subproblems can be solved in total expected time 
O(nQ {O(Klgn))) = O(nQ (O(lg 2 n))) plus the output size. 

One technicality arises: by our assumption, the coordinates of the input and query points in each sub- 
problem should be pre-sorted first. For the x-coordinates, this can be accomplished by scanning through the 
global sorted x-list, and for each input or query point s in order, appending s to the end of the linked lists for 
the cells s participates in. The y- and z-sorted lists can be similarly dealt with. The time required is linear. 

By Lemma l4~3T c). the probability that a query with output size fej is bad is at most ki/K. Thus, the 
expected number of bad queries is at most k/K for total output size k. After recursing twice, the expected 
number of queries and input points both decrease to 0(k/K). The 0(ralgn + k) algorithm would then finish 
in expected time 0((k/K) lg n + k) = 0(k). We conclude that our algorithm runs in overall expected time 
O(n[Q PL (n) + Q (O(lg 2 n))] + k). 

For example, we can use Q PL (n) = O(lglgn) by the point location method from [14] (actually in the 
offline setting, we can just use a plane sweep algorithm with a dynamic van Emde Boas trees), and Qo(n) = 
O(lgn) by a known method for 3-d offline dominance reporting [36 1 . Then our algorithm would run in 
expected time 0(n lg lg n + k). We now show that an even better result is possible when n is small. 

The case of few points. First consider the case n < w°^\ By Lemma |4~T1 Q PL (n) = O(l). We can solve 
3-d dominance reporting for 0(lg 2 n) = o(w/\g w) points in linear time, since after rank space reduction, the 
input can be packed into o(w) bits and the answer can deduced from a word operation. Thus, Qo(0(lg 2 n)) = 
0(1). We therefore get an 0(n + A;)-time algorithm. 

Next consider the case n < 2°^^"\ By Lemma l4~Tl Q PL (^) = 0(1)- Since lg 2 n < w ^, by bootstrap- 
ping with the first case, we can set Qo(0(lg 2 n)) = O(l). We therefore get an 0(n + A;)-time algorithm. 

Theorem 4.4. There is an algorithm for offline 3-d dominance reporting on n input points and n query points 
that runs in expected time 0(n lg lg n + k) if the coordinates have been presorted. The time bound improves 
to 0(n + k) if in addition, n < 2°^). 

4.3 Offline 4-d Dominance Reporting 

We are now ready to present our offline 4-d algorithm. Our algorithm follows the same basic approach 
employed by most data structural upper bounds for orthogonal range searching: we construct a range tree on 
the input points and solve an offline 3-d problem in each node of the tree. Naively, using the 0(n lg lg n + k) 
algorithm from Theorem 14.41 would imply only an 0(n lg n lg lg n + A;) algorithm (which nevertheless is an 
improvement over previous results). We need several additional ideas to achieve the final 0(n lg n + k) result. 

Algorithm. Construct a complete binary tree (range tree) T using the input points ordered by their last 
coordinate as leaves. Associate each query point to the leaf node containing its successor input point w.r.t. the 
last coordinate, and project all input and query points on to the first three dimensions. Each internal node u in 
T naturally defines an offline 3-d dominance reporting problem, using the query points in the right subtree as 
queries (the query points of u), and the input points in the left subtree as input (the input points of u). Clearly 
the combined output of all these 3-d problems constitutes the output for the 4-d problem. 

To speed up the solution of these 3-d problems, our first idea is to use randomized shallow cuttings once 
again, but this time with a different choice of parameter K. Pick a random sample of all the n input points, 
where each point is included with probability 1/K with K := 2*™. For each node u in T, let R u denote the 
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sample of the input points of u. We first compute V(R U ) and W{R U ). We next compute the conflict lists for 
all the cells of VV{R U ) as in Section l4~2t namely, we find the cell of VD(R U ) containing each input point of 
u by point location, and then use breadth-first searches to generate the conflict lists in time linear in their total 
size. For each query point q of u, we find the cell of VV{R U ) containing q by point location. If for a query 
q, there is at least one ancestor node where q is a query point and no cell is found, we say that q is bad. For 
each cell A G VV(R U ), we run the algorithm from Section |4~2~1 to solve the offline 3-d dominance reporting 
subproblem for the input points of u in the conflict list of A and the query points of u inside A which are not 
bad. 

This answers all queries that are not bad in any node. To finish, we recursively solve the offline 4-d 
dominance reporting problem on query points that are bad in at least one node and all the input points, where 
the roles of queries and input points are now reversed. After recursing twice, we terminate by switching to a 
known 0(n lg 2 n + /c)-time algorithm (e.g., B31l56ll58l ). 

Analysis, excluding point location. Our algorithm spends expected time at most 0((n/K) Ign) = o(n) 
to compute V(R U ) and VV(R U ) per level of the tree. The breadth-first searches take expected time O(n) per 
level. By Lemma l4~3l a). every conflict list has size O(Klgn) = 2°(vM w .h.p . ; if this condition is violated 
at any node, we can afford to switch to a trivial polynomial upper bound on the running time. Since the 
total expected size of the 3-d dominance reporting subproblems at the cells is O(n), these subproblems can 
be solved in total expected time 0(n) per level, plus the output size, by applying Theorem 14.41 in the "few 
points" case. One technicality arises: the coordinates of the input and query points in each subproblem should 
be pre-sorted first. As in Section 14.21 this can be accomplished by scanning through the global sorted x-, y-, 
and z-lists in linear time. The total time excluding point location cost is then 0(n) per level, i.e., O(nlgn), 
plus the output size. 

By Lemma l4~3l c). the probability that a query with output size ki is bad at one or more nodes is at most 
ki/K. Thus, the expected total number of bad queries at all nodes is at most k/K for total output size 
k. After recursing twice, the expected number of queries and input points both decrease to 0{k/K). The 
0(n lg 2 n + k) algorithm would then finish in expected time 0((k/K) lg 2 n + k) = 0(k). 

Point location cost. At each node u, we need to perform point locations on the xy-projection of VV(R U ) 
for all input points and query points of u. Unfortunately, the current best offline 2-d orthogonal point lo- 
cation algorithm in general requires O(lglgn) time per query, which would result in suboptimal total time 
0(n lg n lg lg n). We suggest the following key idea: solve all the 2-d point location subproblems collectively, 
by transforming them into one single 3-d problem! 

Specifically, consider point locations for the query points of u; locations of the input points of u can be 
dealt with similarly. The query points for which we must perform a point location in VV(R U ) are precisely 
those in the right subtree of u. These queries lie in a consecutive range of leaves, say li through £j, counted 
from left to right. We now transform each rectangle r = [x\,X2] x [yi, 7/2] in the xy-projection of VV(R U ) 
into the 3-d rectangle r' = [x\, X2] x [2/1,2/2] x [i,j] and collect the set B of all such 3-d boxes over all nodes 
in T. Similarly, we transform each query point q to another 3-d query point q'. If q has coordinates [x, y, z) 
and lies in leaf li, we map q to the point q' = (x, y, i). We then collect the set A of all transformed query 
points, and solve an offline 3-d orthogonal range reporting problem with the points in A and the boxes in B. 
From the output of this offline problem, we can obtain for each query point in A the set of boxes in B that 
contain it. This gives the answers to all the original 2-d point location queries. 

Since the subdivisions VV{R U ) have total expected size 0(n/K) per level of the tree, the expected 
number of boxes in B is 0{(n/K) lgn). On the other hand, the number of points in A is n, and the total 
output size of the 3-d problem is 0{n\gn), since each point in A lies in O(lgn) boxes in B. By applying 
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Lemma l4~2l with b = K e , we can solve the offline 3-d orthogonal range reporting problem in expected time 

0(n Igl n + b 2 (n/K) lg n lg| n + n lgn) = 0(n(lgn/lgiv") 2 + nlgn) = O(nlgn), 
due to the fortuitous choice of K = 2*™. We finally conclude 

Theorem 4.5. There is an algorithm for offline 4-d dominance reporting on n input points and n query points 
that runs in expected time 0(n lg n + k), where k is the total output size. 

4.4 Remarks 

Background on shallow cuttings. Matousek |48l provided a deterministic version of shallow cuttings sat- 
isfying similar, slightly stronger properties as in Lemma 14.31 (without the extra logarithmic factor in (a) and 
with probability 1 in (c) for k < K). Originally, shallow cuttings were developed for halfspace range report- 
ing and defined in terms of arrangements of planes rather than orthants; the first application to dominance 
range reporting was proposed by Afshani |Q/|. For our offline problem, however, preprocessing cost matters 
and the above simpler randomized version is more suitable than its deterministic counterpart. Note that it is 
not advisable to use shallow cuttings in 4-d directly, since the number of vertices in the staircase polyhedron 
V(R) can be quadratic in \R\ in 4-d. 

Higher dimensions and applications. The d-dimensional problem reduces to the (d — 1) -dimensional 
problem at the expense of a logarithmic factor increase, by standard divide-and-conquer. Theorem 14. 5 1 thus 
implies: 

Corollary 4.6. There is an algorithm for offline d-dimensional dominance reporting on n input points and n 
query points that runs in expected time 0(n lg d ~ 3 n + k)for any constant d > 4, where k is the total output 
size. 

Our method also works for offline dominance emptiness; the same bounds hold with k set to 0. In fact, the 
algorithms can be slightly simplified: a query point that dominates no input points is good with probability 1 
by Lemma l4~3"l c). and so there is no need to recurse on the bad queries. 

The problem of reporting enclosure pairs for <i-dimensional boxes immediately reduces to (2d)- 
dimensional dominance reporting. The d-dimensional maxima problem obviously reduces to answering n 
d-dimensional offline dominance emptiness queries. 

For a less obvious application, consider the computation of the Loo-minimum spanning tree of n points. 
A reduction by Krznaric, Levcopoulos, and Nilsson [42] showed that this problem can be reduced to the 
bichromatic Loo-closest pair problem: given n red points and n blue points, find a pair of red and blue points 
with the smallest Loo -distance. Their reduction does not increase the asymptotic running time, if it exceeds 
nlgn. A randomized optimization technique by Chan [13] showed that the problem can be further reduced 
to the following decision problem, without increasing the asymptotic expected running time: given n red 
points and n blue points and a value r, decide whether the Loo -distance is at most r. By drawing hypercubes 
centered at the blue points of side length 2r, this problem in turn is equivalent to deciding whether some blue 
hypercube contains some red point. Build a grid of side length r. We can assign points and hypercubes to grid 
cells via hashing in linear expected time. Inside each cell, the blue hypercubes are d-sided. So, the problem 
reduces to a collection of offline dominance emptiness subproblems with linear total size. 

Corollary 4.7. The maxima problem, the bichromatic Loo-closest pair problem, and the Loo-minimum span- 
ning tree problem in any constant dimension d > 4 can be solved in 0(n lg d_3 n) expected time. 
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A Appendix 

A.l Succinct Rank Queries (Proof of Lemma l23l) 

The proof is rather standard. We will store a "checkpoint" once every S lg n positions in the array: a record 
with £ entries (of lg n bits each) that indicates how many elements of each kind we have prior to that position. 
Then, for each element in the array, we can simply write A[i] and the number of elements equal to A[i] between 
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the last checkpoint and i. This uses 0(lg(S lg n)) bits per element, so it fits our space bound if £ > \/lgn. 
The query simply adds the counter stored with A[i] and the appropriate counter from the last checkpoint. 

If the alphabet is smaller, we employ a 2-level scheme. We store a checkpoint as above every £ lg n 
positions. Additionally, every Slglgn positions, we store a minor checkpoint: a record of Slg(Slgn) = 
0(T> lg lg n) bits which indicates the number of elements of each kind from the last checkpoint to the minor 
checkpoint. A query retrieves the appropriate counters from the last checkpoint and the last minor checkpoint, 
and then must solve the rank problem between the last checkpoint and the query position. Since £ lg lg n < 
^/lg n ■ lglgn, the array entries between minor checkpoints fit in 0{yj\gn ■ lg 2 lgn) bits. Thus, we can 
simply store the array entries in plain form, and use a precomputed table of space n ^ to answer rank queries 
between minor checkpoints in constant time. 

A.2 Offline 2-d Orthogonal Point Location for Few Points (Proof of Lemma I47T1) 

The proof follows the bit-packing approach of Chan and Patra§cu [16]. First scan through the sorted input 
lists to reduce all coordinates to rank space, that is, every coordinate of a query and rectangle is an integer of 
value O(n). 

We reduce our problem to a number of 1-d disjoint-intervals stabbing problems (given a set of points and 
disjoint intervals, return for each point the interval containing it if the interval exists). Essentially we construct 
a segment tree on the rectangles (where we divide according to x-coordinates) and solve a 1-d problem in each 
node: Consider a trie of depth 0(lg n) over the binary alphabet. For each rectangle r = [x\, X2] x [y\, 2/2], 
let i\ and £2 denote the leaves corresponding to the binary representation of x\ and X2. Now consider the two 
paths from these leaves to their lowest common ancestor. For each node u on the path to £\ where £\ lies in 
the left child's subtree, associate the interval [2/1,2/2] to the right child. For £2, do the same, but with the roles 
of left and right reversed. Observe that the 2/-intervals associated with each node in the trie are disjoint, by 
the disjointness of the rectangles. Our first task is to compute for each node in the trie, a sorted list of the 
associated 2/-intervals, where the list has been packed into words to allow 0(w/ lg n) consecutive intervals to 
be stored in one word. 

We construct these lists essentially by external-memory radix sorting. We start at the root node where 
we are given the complete input set S in sorted order of bottom 2/-coordinates. We scan over this list and 
distribute the rectangles to two sets, Se and S r , one for the left child of the root, and one for the right. These 
lists are again packed into words. The set Se contains those rectangles [xi,X2] x [2/1,2/2] f° r which the left 
child lies on the path from the root to either of the leaves corresponding to the binary representation of x\ or 
X2- The set S r is similar. Observe that this can be determined directly from the binary representation of x\ 
and X2- Furthermore, if [x\, X2] completely contains the range of x-coordinates stored in the leaves associated 
with either the left or the right subtree, we append [2/1,2/2] to a list stored for the root of that subtree. These 
lists are also packed into words. Finally, we recurse on the left and right subtree, using Sg and S r as input 
respectively. 

Since the rectangles in S are given in sorted 2/-order and the 2/-intervals associated with each node are 
disjoint, this correctly constructs the desired lists. Furthermore, observe that we can handle all 0(w/\gn) 
rectangles stored in one word in O(l) time using table lookups. Thus we spend 0{{n\gn) /w) time on each 
of 0(lg n) levels of the trie to construct the desired lists; the total time is 0((n lg 2 n)/w). 

We now associate each query (x, y) to the set of nodes on the path from the root to the leaf corresponding 
to the binary representation of x. Using the same approach as for the rectangles, we obtain a 2/-sorted list 
of the associated queries in each node of the trie in total time 0((n lg 2 n)/w). To finish, we solve the 1-d 
disjoint-intervals stabbing problem in each node of the trie by scanning the list of associated intervals and the 
list of associated queries simultaneously (in order of 2/-coordinates). Note that we produce output only when 
an interval contains a query point. Using table lookups when performing the scan, we may advance at least 
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one word in one of the lists in 0(1 + k') time, where k' denotes the output size between the queries and the 
intervals in the two considered words. Over the entire trie, the total output size is 0(n) and the total cost is 

0{{n\g 2 n)/w + n) = 0(n) for n < 2°(vM . 

A.3 An Alternative Algorithm for a Special Case of 4-d Offline Dominance Emptiness 

In this subsection, we give an alternative deterministic 0(n lgn)-time algorithm for a special case of 4-d 
offline dominance emptiness: given n red points and n blue points, decide whether there exists a red point 
p and a blue point q such that p is dominated by q. The original 4-d offline dominance emptiness problem 
is stronger: there, we want to know for every blue point q whether there exists a red point dominated by q. 
This special case is sufficient, for example, to solve the bichromatic -closest pair and the -minimum 
spanning tree problem discussed in Section l4~4l however, it is not sufficient to solve the maxima problem. The 
alternative algorithm has the advantage that it avoids "bit tricks", though it requires a nontrivial subroutine — 
an algorithm of Chazelle Ell for intersecting convex polyhedra. 

We first consider the problem in 3-d. It has been observed that techniques for halfspace range searching 
can often be adapted to dominance range searching [ 1|. We first point out an explicit way to reduce dominance 
to halfspace range searching. Surprisingly, this reduction has not appeared before to the authors' knowledge 
(although the idea behind the reduction, which is based on an exponentially spaced grid, is commonplace). 
Specifically, fix a constant r > 3. Assume that all the points have positive integer coordinates (we can initially 
sort the coordinates once at the beginning and reduce to rank space). Transform each red point p = k) 
to the point p* = (r l ,r^,r k ). Transform each blue point q = (a, b, c) to the halfspace q* = {(x,y,z) : 
x/r a + y jr h + z/r c < 3}. It is easy to see that p is dominated by q iff p* lies inside g*: if i < a, j < b, and 
k < c, then r % jr a + r J /r fe + r k jr c < 3, but if i > a, j > b, or k > c, then r % jr a + r- 7 jr h + r k jr c > r > 3. 

Let P be the convex hull of the transformed red points and Q be the intersection of the complements of the 
transformed blue halfspaces. Then the answer to our red/blue dominance problem is no iff every transformed 
red point lies in the complement of every transformed blue halfspace, i.e., P lies inside Q, i.e., P n Q = P. 
Chazelle ll22l has given a linear-time algorithm for intersecting two convex polyhedra. Thus, the 3-d problem 
can be solved in linear time, provided that the polyhedra P and Q are given. 

Now, to solve the 4-d problem, we build a binary range tree T according to the last coordinate as before 
and obtain a series of 3-d subproblems of total size 0(n lg n). Observe that we can pre-compute the red convex 
hulls P at all the nodes of T bottom-up in 0(n lg n) time, by repeatedly using Chazelle's linear-time algorithm 
for merging two convex hulls (computing the convex hull of two convex polyhedra is dual to intersecting two 
convex polyhedra). Similarly, we can pre-compute the blue halfspace intersections Q at all the nodes of T in 
O(nlgn) time, again by repeatedly using Chazelle's algorithm for intersecting two halfspace intersections. 
This immediately gives a solution to the 4-d problem with overall running time 0(n lg n). 

Remarks: Precision issues seem to arise since the coordinates of the transformed points and halfspaces involve 
large numbers, but we can simulate any primitive operation on these points and halfspaces by treating r as a 
symbolic variable that approaches infinity. It would be interesting to see if we can directly merge or intersect 
staircase polyhedra without going through the transformation and invoking Chazelle's algorithm. Note that the 
above approach does not work at all for the offline dominance reporting problem, or for that matter, the offline 
dominance emptiness problem (in the 3-d subproblem, we do not know the answer for any non-maximal blue 
query point whose halfspace does not appear on dQ). 
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